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Abstract 

In this paper we show that the problem of identifying an edge (i, j) in a graph 
G such that there exists an optimal vertex cover S oi G containing exactly one 
of the nodes i and j is NP-hard. Such an edge is called a weak edge. We then 
develop a polynomial time approximation algorithm for the vertex cover problem 
with performance guarantee 2 — where a is an upper bound on a measure 
related to a weak edge of a graph. Further, we discuss a new relaxation of the 
vertex cover problem which is used in our approximation algorithm to obtain 
smaller values of a. We also obtain linear programming representations of the 
vertex cover problem for special graphs. Our results provide new insights into 
the approximability of the vertex cover problem - a long standing open problem. 

Keywords: vertex cover problem, approximation algorithm, LP-relaxation, weak edge 
reduction, NP-complete problems 

1 Introduction 

Let G = (V, E) be an undirected graph on the vertex set V = {1,2, ... ,n}. A vertex 
cover of G is a subset S of V such that each edge of G has at least one endpoint in S. 
The vertex cover problem (VCP) is to compute a vertex cover of smallest cardinality 
in G. The VCP is NP-hard on an arbitrary graph but solvable in polynomial time on 
a bipartite graph. A vertex cover S is said to be 7-optimal if 15*1 < 7|5'''| where 7 > 1 
and 5° is an optimal solution to the VCP. 
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It is well known that a 2-optimal vertex cover of a graph can be obtained in polyno- 
mial time by taking all the vertices of a maximal (not necessarily maximum) matching 
in the graph or rounding up the LP relaxation solution of an integer programming 
formulation [18]. There has been considerable work (see e.g. survey paper [Hj) on the 
problem over the past 30 years on finding a polynomial-time approximation algorithm 
with an improved performance guarantee. The current best known bound on the perfor- 
mance ratio of a polynomial time approximation algorithm for VCP is 2 — 0(;^/|=^) [12] • 
It is also known that computing a 7-optimal solution in polynomial time for VCP is 
NP-Hard for any 1 < 7 < lOy/E — 21 ~ 1.36 In fact, no polynomial-time (2 — e)- 
approximation algorithm is known for VCP for any constant e > and existence of 
such an algorithm is one of the most outstanding open questions in approximation algo- 
rithms for combinatorial optimization problems. Under the assumption that the unique 
game conjecture [9[ |13|, HI] is true, many researchers believe that a polynomial time 
(2 — e)-approximation algorithm with constant e > is not possible for VCP. For recent 
works on approximability of VCP, we refer to[Ill2|5l|6l[71|8l|10l|12l[l5l[l6]. Recently 
Asgeirsson and Stein [21 [3] reported extensive experimental results using a heuristic 
algorithm which obtained no worse than |-optimal solutions for all the test problems 
they considered. Also, Han, Punnen and Ye [S] proposed a (| + ^)-approximation algo- 
rithm for VCP, where ^ is an error parameter calculated by the algorithm and reported 
that no example was known where ^ 7^ 0. 

A natural integer programming formulation of VCP can be described as follows: 

En 

{IP) s.t. Xi + xj>l,{t,j) e E, (1) 
Xi e {0, l},i = 1,2, ■ ■ ■ ,n. 

Let X = {xi,X2, . . . , Xn) be an optimal solution to ([1]). Then Sjp = {i | = 1} is an 
optimal vertex cover of the graph G. The linear programming (LP) relaxation of the 
above integer program is 

(LPR) s.t. Xi + Xj > 1, e E, (2) 
Xi > 0,i = 1,2, ■ ■ ■ ,n. 

It is well known that (e.g. [17]) any optimal basic feasible solution (BFS) x* = 
{xl, X2, . . . , x^) to the problem LPR, satisfies x* e {0, |, 1}. Let Slp = {i \ x* = 
^ or X* = 1}, then it is easy to see that Slp is a 2-approximate solution to the VCP on 
graph G. Nemhauser and Trotter [IB] have further proved that there exists an optimal 
integer solution to ([1]), which agrees with x* in its integer components. 
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An G is said to be a weak edge if there exists an optimal vertex cover 

of G such that \V^ fl = 1. Likewise, an E E is said to be a strong 

edge if there exists an optimal vertex cover of G such that \V^ fl = 2. 

An edge is uniformly strong if n {i,i}| = 2 for any optimal vertex cover 

Note that it is possible for an edge to be both strong and weak. Also is 
uniformly strong if and only if it is not a weak edge. In this paper, we show that 
the problems of identifying a weak edge and identifying a strong edge are NP-hard. 
We also present a polynomial time (2 — ■^^)-SippYox\msition algorithm for VCP where 
a is an appropriate graph theoretic measure (to be introduced in Section 3). Thus 
for all graphs for which a bounded above by a constant, we have a polynomial time 
(2 — e)-approximation algorithm for VCP. So far we could not identify any class of 
graphs where a is anything but a constant. We also give examples of graphs satisfying 
the property that a = 0. However, establishing tight bounds on a, independent of 
graph structures remains an open question. VCP is trivial on a complete graph 
since any collection of n — 1 nodes serves as an optimal solution. However, the LPR 
gives an objective function value of | only. We give a stronger relaxation for VCP and 
complete linear programming description of VCP on a complete graph, wheels, among 
others. This relaxation can also be used to find reasonable expected guarantee for a. 

For any graph G, we sometimes use the notation V{G) to represent its vertex set 
and E{G) to represent its edge set. 

2 Complexity of weak and strong edge problems 

The strong edge problem can be stated as follows: "Given a graph, identify a strong 
edge of G or declare that no such edge exists." 

Theorem 1 The strong edge problem is NP-hard. 

Proof. If G is bipartite, then it does not contain a strong edge. If G is not bipartite, 
then it must contain an odd cycle. For any odd cycle cu, any vertex cover must contain 
at least two adjacent nodes of cu and hence G must contain at least one strong edge. 
If such an edge {i,j) can be identified in polynomial time, then after removing the 
nodes i and j from G and applying the algorithm on G — {i,j} and repeating the 
process we eventually reach a bipartite graph for which an optimal vertex cover V can 
be identified in polynomial time. Then V together with the nodes removed sofar will 
form an optimal vertex cover of G. Thus if the strong edge problem can be solved in 
polynomial time, then the VCP can be solved in polynomial time. ■ 

The problem of identifying a weak edge is much more interesting. The weak edge 
problem can be stated as follows: "Given a graph G, identify a weak edge of G." It 
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may be noted that unlike a strong edge, a weak edge exists for all graphs. We will now 
show that the weak edge problem is NP-hard. Before discussing the proof of this, we 
need to introduce some notations and definitions. 

Let optimal BPS of LPR, the linear programming re- 

laxation of the VCP. Let Iq = {i : x* = 0} and Ii = {i : x* = 1}. The graph 
G = G \ {Iq U Ii} is called the {0, l}-reduced graph of G. The process of computing G 
from G is called a {0,1} -reduction. 

Lemma 1 [TE^ If R is a vertex cover of G then RU Ii is a vertex cover of G. If R 
is optimal for G, then RU Ii is an optimal vertex cover for G. If R is an 'y-optimal 
vertex cover of G, then RU Ii is an 'y -optimal vertex cover of G for any 7 > 1. 

Let (z,j) be an edge of G. Define Aij = {k \ {i,k) G E{G) and {j,k) e E{G)}, 
D, = {se ViG) I it,s) e EiG),s ^ j,s A,,}, and = {t e ViG) \ G 
E(G),t ^ i,t ^ ^ij}- Construct the new graph G^^'^^ from G as follows. Prom graph 
G, delete Aij and all the incident edges, connect each vertex s G -Dj to each vertex 
t G Dj whenever such an edge is not already present, and delete vertices i and j 
with all the incident edges. The operation of constructing G^'^'^^ from G is called an 
{i, j) -reduction. When {i,j) is selected as a weak edge, then the corresponding {i,j)- 
reduction is called a weak edge reduction. The weak edge reduction is a modified version 
of the active edge reduction operation introduced in [8]. 



Pigure 1: Weak edge reduction using edge (1,2). Di = {8,4}, D2 = {7} and A12 = 
{3, 5}. The graph G^^''^^ is on the right. 
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Lemma 2 Let be a weak edge of G, RC V{G^^'^^) and 

^ (RUA,,U{j}, ifD.CR; 
\RU Aij U {i}, otherwise, 

1. If R is a vertex cover of G^^'^\ then R* is a vertex cover of G. 

2. If R is an optimal vertex cover of G^'^'^\ then R* is an optimal vertex cover of G. 

3. If R is a 'j-optimal vertex cover in G^^'^\ then R* is a 'j-optimal vertex cover in 
G for any 7 > 1 . 

Proof. If Di (1 R then all arcs in G incident on i, except possibly is covered by 

R. Then R* = RU Aij U {j} covers all arcs incident on j, including {i,j) and hence 
R* is a vertex cover in G. If at least one vertex of Di is not in R, then all vertices in 
Dj must be in R by construction of G^^'^\ Thus R* = RU A^- U {i} must be a vertex 
cover of G. 

Suppose R is an optimal vertex cover of G^^'^\ Since (i, j) is a weak edge, there 
exists an optimal vertex cover, say V^, of G containing exactly one of the nodes i or 
j. Without loss of generality, let this node be i. For each node k G A^j, {i,j,k) is a 
3-cycle in G and hence k & for all k G Aij. Let = — {{i} U Aij), which is a 
vertex cover of G^'^'^\ Then \R\ = \V^\ for otherwise if |-R| < \V^\ we have |-R*| < \V^\, 
a contradiction. Thus \R*\ = \V^\ establishing optimality of R*. 

Suppose R is an 7-optimal vertex cover of G*^*'-'^ and let V^^'^^ be an optimal vertex 
cover m G^'^^\ Thus 

< 7|V"(*'-'')| where 7 > L (3) 

Let be an optimal vertex cover in G. Without loss of generality assume i & and 
since {i,j) is weak, j ^ V^. Let = - {{i} U A,j). Then \V^\ = Thus we 

from ([3D, \R\ < -f\V^\. Thus 

\R*\ < l\V^\ + |A,,| + 1 < 7(|r^| + |A,,| + 1) < 7|y°|. 

Thus R* is 7-optimal in G. ■ 

Suppose that an oracle, say WEAK{G, i,j), is available which with input G out- 
puts two nodes i and j such that {i,j) is a weak edge of G. It may be noted that 
WEAK(G, i,j) do not tell us which node amongst i and j is in an optimal vertex cover. 
It simply identifies the weak edge {i,j). Using the oracle WEAK(G, we develop 
an algorithm, called weak edge reduction algorithm or WER-algorithm to compute an 
optimal vertex cover of G. 
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The basic idea of the scheme is very simple. We apply {0, 1} and weak edge re- 
ductions repeatedly until a null graph is reached, in which case the algorithm goes to 
a backtracking step. We record the vertices of the weak edge identified in each weak 
edge reduction step but do not determine which one to be included in the output ver- 
tex cover. In the backtrack step, taking guidance from lemma 2, we choose exactly 
one of these two vertices to form part of the vertex cover we construct. In this step, 
the algorithm computes a vertex cover for G using all vertices in Ajj removed in the 
weak edge reduction steps, vertices with value 1 removed in the {0, 1} reduction steps, 
and the selected vertices in the backtrack step from the vertices corresponding to the 
weak edges recorded during the weak edge reduction steps. A formal description of the 
WER-algorithm is given below. 

The WER- Algorithm 
Step 1: {* Initialize *} A; = 1, = G. 

Step 2: {* Reduction operations *} = 0, 4j = 0, {ik,jk) = 0- 

1. {* {O,l}-reduction *} Solve the LP relaxation problem LPR of VCP on 
the graph Gk- Let = {x'^ : i G V{Gk)} be the resulting optimal BPS, 
4,0 = {i I = 0}, 4,1 = {i I xf = 1}, and 4 = 4,o U 4,i. 

If V{Gk) \ 4 = goto Step 3 else Gk^Gk\ 4 endif 

2. {* weak edge reduction *} Call WEAK(Gfe,i,j) to identify the weak edge 

Let Gk+i = G^^'^\ where G^^'^^ is the graph obtained from Gk using 
the weak edge reduction operation. Compute A^ for Gk as defined in the 
weak edge reduction. Let A^ = Aij,ik = i,jk = j- 
If Gfc+i 7^ then k = k + 1 goto beginning of Step 2 endif 

Step 3: L=k+1, Sl = 0. 

Step 4: {* Backtracking to construct a solution *} 

Let Sl-1 = SlU 

If («L-i, jL-i) then Sl-1 = Sl-i U Ai_i U R*, where 



R* 




if C5i; 
otherwise. 



and = {s ■■ {iL-i, s) G Gl-i, s ^ ji-i, s ^ Ai_i} endif 

L = L - 1, 

If L 7^ 1 then goto beginning of step 4 else output 5*1 and STOP endif 
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Using Lemma 1 and Lemma 2, it can be verified that the output Si of the WER- 
algorithm is an optimal vertex cover of G. It is easy to verify that the complexity of 
the algorithm is polynomial whenever the complexity of WEAK(G, i, j) is polynomial. 
Since VCP is NP-hard we established the following theorem: 

Theorem 2 The weak edge problem is NP-hard. 

3 An approximation algorithm for VCP 

Let VCP(i, j) be the restricted vertex cover problem where feasible solutions are vertex 
covers of G using exactly one of the vertices from the set {i,j} and looking for the 
smallest vertex cover satisfying this property. More precisely, VCP{i,j) tries to identify 
a vertex cover V* of G with smallest cardinality such that \V* fl {«, j}| = 1. Let 6 and 
S{i,j) be the optimal objective function values of VCP and VCP(i, j) respectively. If 
is indeed a weak edge of G, then 6 = S{i,j). Otherwise, 

6{t,j) =6 + a{t,j), (4) 

where (j{i,j) is a non-negative integer. Further, using arguments similar to the proof 
of Lemma 2 it can be shown that 

Oj + A,, + 1 = 5{i,]) =5 + ail,]). (5) 

where is the optimal objective function value VCP on G^'''^\ 
Consider the optimization problem 

WEAK-OPT: Minimize a 

Subject to (i, j) G E{G) 

WEAK-OPT is precisely the weak edge problem in the optimization form and its 
optimal objective function value is always zero. However this problem is NP-hard by 
Theorem [21 We now show that an upper bound a on the optimal objective function 
value of WEAK-OPT and a solution (i, j) with (j{i,j) < a can be used to obtain a 
(2 — Y^)-approximation algorithm for VCP. Let ALMOST- WEAK(G', i, j) be an oracle 
which with input G computes an approximate solution (i, j) to WEAK-OPT such that 
(^{'i'li) ^ 0" for some cr. Consider the WER-algorithm with WEAK(G', i, j) replaced by 
ALMOST-WEAK(G,z, j). We call this the AWER-algorithm. 

Let Gfc, k = l,2,...t be the sequence of graphs generated in Step 2(2) of the 
AWER-algorithm and {ik,jk) be the approximate solution to WEAK-OPT on Gk, k = 
1,2, . . ., t identified by ALMOST- WEAK(G'fc, 
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Theorem 3 The A WE R- algorithm identifies a vertex cover Si such that \Si\ < (2 — 
Y^)!*^*! where S* is an optimal solution to the VCP. Further, the complexity of the 
the algorithm is 0{n{(j){n) + ipi^n)) where n = |V^(G)|, (f){n) is the complexity of LPR 
and ip{n) is the complexity of ALMOST-WEAK(G,i, j ). 

Proof. Without loss of generahty, we assume that the LPR solution = {xl,xl, . . . , xD 
generated when Step 2(1) is executed for the first time satisfies x} = | for all i. If this 
is not true, then we could replace G by a new graph G = (j\ {/i^i U/i_o} and by Lemma 
1, if S" is a 7-optimal solution for VCP on G then S U Ii^i is a 7-optimal solution on G 
for any 7 > 1. Thus, under this assumption we have 

n<2\S*\. (6) 

Let t be the total number of iterations of Step 2 (2). For simplicity of notation, we 
denote ak = cr{ik,jk) and 6k = S{ik,jk)- Note that 6k and 6k are optimal objective 
function values of VCP and VCP{ik, jk), respectively, on the graph Gk- In view of 
equations (jlj) and ([5]) we have, 

6k = 6k + (Tk, k = l,2,...,t (7) 

and 

4+i + |Ai,,,-J + |4,i| + l = 4, k = l,2,...,t. (8) 
From ([7]) and ([8]) we have 

4+1 - ^fc = o-fc - |Ai^jJ - - 1, k = l,2,...,t. (9) 

Adding equations in ([9]) for /c = 1, 2, . . . , t and using the fact that 6t+i = \It+i,i\, we 
have, 

t 

\Si\ = \S*\ + J2(^k, (10) 

k=l 

where \S*\ = 61, and by construction, 

l^il = 4t\4,i + SLiAfc + t. (11) 

But, 

mG)| = Sit\4 + SLiA, + 2t. (12) 
From (Uni), (HI]) and dnD, we have 



^ \ViG)\ - J:it\h - n=Ak ^ \ViG)\ - \S*\ - ^Uiak - 1) _ ^^3^ 
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From inequalities and (fT^ . we have 

\S*\-tia- 

where a = ''^^ *' . Then we have 



Thus, 

\S^\ _ \S*\ + J:Ua, _ \S*\+ta ^ ^ ^ _ 1 

\S*\ - a+1- a+1 1 + a' 

The complexity of the algorithm can easily be verified. ■ 

The performance bound established in Theorem [3] is useful only if we can find an ef- 
ficient way to implement our black-box oracle ALMOST- WEAK(G, that identifies 
a reasonable in each iteration. If ALMOST- WEAK(G', simply generates a 
random edge, then a{i, j) could be as large as 0{n) as given in the example of Figure 2. 
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Figure 2: A 3D-wheel with central axis (6, 7). 



For a 3D- wheel on n nodes with central axis {n — l,n), a{n — l,n) = [{f } — 2j . 
However, when is chosen as any other edge, cr{i,j) = 0. A trivial upper bound 
on 0" is ^ for any graph on n nodes. Let us now explore the possibilities of improving 
this trivial bound. 

Any vertex cover must contain at least s + 1 vertices of an odd cycle of length 
2s + 1. This motivates the following extended linear programming relaxation (ELP) of 
the VCP, studied in [Hll]. 



t < 



\S*\ 



a 



9 



n 

mill Xi 

i=l 



(ELP) Xi + Xj > e E, (^^4^ 

a^i > 0, = 1, 2, . . . , n, 

where f2 denotes the set of all odd-cycles of G and Uk & Q contains 2sk + 1 vertices for 
some integer Sk- Note that although there may be an exponential number of odd-cycles 
in G, since the odd cycle inequalities has a polynomial-time separation scheme, ELP 
is polynomially solvable. Further, it is possible to compute an optimal BFS of ELP in 
polynomial time. 

Let x^ be an optimal basic feasible solution of ELP. An edge (r, s) G -E is said to 
be an active edge with respect to x° if -|- x° = 1. There may or may not exist an 
active edge corresponding to an optimal BFS of the ELP as shown in [8]. For any arc 
(r, s), consider the restricted ELP (RELP(r, s)) as follows: 



mm Xi 

i=l 

s.t. Xi + Xj >l,{i,j) e E\{{r,s)}, 
{RELP{r,s)) xr + x, = l, (15) 

^Xi> Sk + l,ujk e Q, 

Xi > 0,i = 1,2, . . . ,n, 

Let Z{r, s) be the optimal objective function value of RELP(r, s). Choose {p, q) G E{G) 
such that 

Z{p,q) = mm{Z{i,3) : {i,j)eE{G)}. 

An optimal solution to RELP(p, q) is called a RELP solution. It may be noted that if 
an optimal solution x* of the ELP contains an active edge, then x* is also an RELP 
solution. Further Z{p, q) is a lower bound on the optimal objective function value of 
VCP. 

The VCP on a complete graph is trivial since any collection of (n — 1) nodes form an 
optimal vertex cover. However, for a complete graph, LPR yields an optimal objective 
function value of | only and ELP yields an optimal objective function value of 
Interestingly, the optimal objective function value of RELP on a complete graph is 
n — 1, and the RELP solution is indeed an optimal vertex cover on a complete graph. 
A stronger version of this observation is proved in the following theorem. 
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Theorem 4 For any G E{G), an optimal BFS of the linear program RELP(i, j ) 
gives an optimal vertex cover of G whenever G is a complete graph or a wheel. 



Proof. Suppose G is a complete graph with V{G) = {1,2, .. . ,n}. Without loss of 
generality assume {i,j) = (1,2). Thus 



Let optimal basic solution of RELP(z, j). Since x^ + a;^ = 1, 

by odd cycle inequalities, we have 



Hence = 1 for k = 3,4, . . . ,n yielding Z{i,j) = n — 1 . Now we have to establish that 
Xi and X2 cannot be fractional. If + x° + = 2 for any {r, s} ^ {1, 2} then x? = 
yielding Xg = 1. Similarly if Xg + x^ + x^ = 2 for any {r,s} ^ {1, 2} then Xg = yielding 
x\ = \. If x° + x° + Xj > 2 for all 3-cycles other than those in (fT71) it can be shown 
that there must exist an edge inequality, other than (fT6!) . satisfied as an equality. Such 
an equality must be of the form x'j' + x° = 1 or x^ + x° = 1 for r G {3, 4, . . . , n} and 
hence x\ and x\ can take only values zero or one. Both {2, 3, . . . , n} and {1,3,..., n} 
are optimal vertex covers for G. The proof for the case of a wheel can be obtained 
using similar analysis and we skip the details. ■ 

It may be noted that an optimal BFS of RELP(i, j) gives an optimal vertex cover 
on a 3D-wheel (Figure 2) when {i,f) is not the central axis. 

Extending the notion of an active edge corresponding to an ELP solution an 
edge & E is said to be an active edge with respect to an RELP solution x° if 

x° + x° = 1. Unlike ELP, an RELP solution always contains an active edge. In AWER- 
algorithm, the output of ALMOST- WEAK(G', can be selected as an active edge 
with respect to a RELP solution on G. 

We believe that the value of a{i,j), i.e. the absolute diff'erence between the optimal 
objective function value of VCP and the optimal objective function value of VCP(i, j), 
when {i,j) is an active edge corresponding to an RELP solution is a constant with 
very high probability, if not with probability one. If it is a constant, then our algo- 
rithm resolves the long standing question on the existence of a polynomial time 2 — e 
approximation algorithm for VCP for constant e > 0. It is an open question to ob- 
tain a tight bound, deterministic or probabilistic, on this interesting graph theoretic 
measure. Nevertheless, our results provide new insight into the approximability of the 
vertex cover problem. 




(16) 



CC-^ ~\~ x<2 ~\~ ^ 1^ — 2, k — 2, 3, • • • 5 



(17) 
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4 Conclusion 



In this paper, we proved that weak edge and strong edge problems are NP-hard. We 
also presented a polynomial time (2 — ^^)-approximation algorithm for VCP where a is 
a well defined graph theoretic measure. Obtaining tight upper bounds on cr, determin- 
istic or probabilistic, is an open question. We also provide simple linear programming 
representation of VCP on a complete graph, wheel and 3D-wheel, among other graphs. 
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